Improve quantile performance #86

bkamins · 2021-09-08T08:29:57Z

Fixes #84

codecov · 2021-09-08T08:32:31Z

Codecov Report

Merging #86 (f8015c8) into master (54f9b0d) will decrease coverage by 1.96%.
The diff coverage is 75.00%.

@@            Coverage Diff             @@
##           master      #86      +/-   ##
==========================================
- Coverage   98.18%   96.22%   -1.97%     
==========================================
  Files           1        1              
  Lines         386      424      +38     
==========================================
+ Hits          379      408      +29     
- Misses          7       16       +9

Impacted Files	Coverage Δ
src/Statistics.jl	`96.22% <75.00%> (-1.97%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 54f9b0d...f8015c8. Read the comment docs.

bkamins · 2021-09-08T08:32:47Z

I still need to add correctness tests.

bkamins · 2021-09-08T08:59:59Z

@nalimilan - doing an "additional sorting" is not correct unfortunately (the second sorting destroys the first), so we need to do this independently unfortunately (still it should not be that bad).

src/Statistics.jl

nalimilan · 2021-09-08T10:06:28Z

src/Statistics.jl

@@ -937,22 +937,33 @@ function quantile!(q::AbstractArray, v::AbstractVector, p::AbstractArray;
    end
    isempty(q) && return q

-    minp, maxp = extrema(p)
-    _quantilesort!(v, sorted, minp, maxp)
+    if length(p) == 2


Special-casing 2 is kind of weird. For example, for [0.25, 0.5, 0.75] this branch would probably also be faster, right? Actually, isn't this approach faster than the other in most cases?

A possible optimization would be to call partialsort! on the full array for the first quantile, then call it on a view from the first quantile to the end of the array for the second, and so on. Not sure that would make a big difference, but at least it shouldn't be really slower than sorting everything between the two extreme quantiles, right?

Yes, it is faster:

julia> using Statistics julia> using BenchmarkTools julia> x = rand(10^4); julia> @btime quantile($x, [0.25, 0.5, 0.75]); 410.400 μs (4 allocations: 78.42 KiB) julia> @btime [quantile($x, 0.25), quantile($x, 0.5), quantile($x, 0.75)]; 314.000 μs (7 allocations: 234.72 KiB)

The problem is that quantiles do not need to be sorted, so this complicates the code (but of course is doable as I guess sorting requested quantiles should not be problematic for performance).

We do not need views I think, as sort! supports passing start and stop ranges.

However, as it seems to be a bigger rework I will leave it for after DataFrames.jl 1.3 releaes when we work on general Statistics update.

OK. Yes, sorting quantiles should have a negligible cost (and most of the time they will already be sorted).

Do you feel like finishing this?

I have opened https://github.com/JuliaLang/Statistics.jl/pull/91 to perform easy comparison of both.

bkamins added 2 commits September 8, 2021 10:29

Improve quicksort performance

89ba1cb

Update Statistics.jl

5b8e75d

bkamins commented Sep 8, 2021

View reviewed changes